Similarity-Based Alignment and Generalization

نویسندگان

  • Daniel Oblinger
  • Vittorio Castelli
  • Tessa A. Lau
  • Lawrence D. Bergman
چکیده

We present a novel approach to learning predictive sequential models, called similarity-based alignment and generalization, which incorporates in the induction process a specific form of domain knowledge derived from a similarity metric of the points in the input space. When applied to Hidden Markov Models, our framework yields a new class of learning algorithms called SimAlignGen. We discuss the application of our approach to the problem of programming by demonstration–the problem of learning a procedural model of a user’s behavior by observing the interaction an application GUI. We describe in detail the SimIOHMM, a specific instance of SimAlignGen that extends the known Input-Output Hidden Markov Model (IOHMM). We use the SimIOHMM in empirical evaluations that demonstrates the dependence of the prediction accuracy on the introduced similarity bias, as well as the computational gains over the IOHMM.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A generalization of Profile Hidden Markov Model (PHMM) using one-by-one dependency between sequences

The Profile Hidden Markov Model (PHMM) can be poor at capturing dependency between observations because of the statistical assumptions it makes. To overcome this limitation, the dependency between residues in a multiple sequence alignment (MSA) which is the representative of a PHMM can be combined with the PHMM. Based on the fact that sequences appearing in the final MSA are written based on th...

متن کامل

Similarity-Based Alignment and Generalization: A New Paradigm for Programming by Demonstration

We present an approach to learning procedural knowledge by demonstration called similarity-based alignment and generalization. Key to our approach is the ability to induce complex procedure structure (loops and conditional branches) by aligning multiple unannotated demonstrations of a procedure. We present an implemented instance of a similarity-based alignment and generalization algorithm that...

متن کامل

Hybrid DNA Sequence Similarity Scheme for Training Support Vector Machines

Similarity between two DNA sequences is based on alignment. There are different approaches of alignments; each has its own specialty of bearing different information on DNA sequence. This paper presents a study on similarity kernels based on different similarity schemes and proposes a hybrid one. Similarity Kernel is required in order to represent the distance or similarity between two DNA sequ...

متن کامل

Identification of BKCa channel openers by molecular field alignment and patent data-driven analysis

In this work, we present the first comprehensive molecular field analysis of patent structures on how the chemical structure of drugs impacts the biological binding. This task was formulated as searching for drug structures to reveal shared effects of substitutions across a common scaffold and the chemical features that may be responsible. We used the SureChEMBL patent database, which prov...

متن کامل

Bayesian Alignment of Similarity Shapes.

We develop a Bayesian model for the alignment of two point configurations under the full similarity transformations of rotation, translation and scaling. Other work in this area has concentrated on rigid body transformations, where scale information is preserved, motivated by problems involving molecular data; this is known as form analysis. We concentrate on a Bayesian formulation for statisti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005